Study of MPEG-7 Sound Classification and Retrieval

نویسندگان

  • Hyoung-Gook Kim
  • Edgar Berdahl
  • Thomas Sikora
چکیده

In this paper, we present a comparison of three audio taxonomy methods for MPEG-7 sound classification. The MPEG-7 sound classification and indexing tools consist of both low-level and high-level description schemes. For the low-level descriptors that we use, low-dimensional features based on spectral basis descriptors are produced in three stages: normalized audio spectrum envelope, principal component analysis, and independent component analysis. High-level description schemes are used thereafter to describe the modeling of audio features, the procedure of audio classification, and retrieval. For classification we test three approaches: the direct approach, the hierarchical approach without hints, and the hierarchical approach with hints. Our experimental results show that the best approach is the hierarchical approach with hints, which results in a classification accuracy of around 99%. The direct approach produces the second best results, and the hierarchical approach without hints the third best results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

General sound classification and similarity in MPEG-7

We introduce a system for generalised sound classification and similarity using a machine-learning framework. Applications of the system include automatic classification of environmental sounds, musical instruments, music genre and human speakers. In addition to classification, the system may also be used for computing similarity metrics between a target sound and other sounds in a database. We...

متن کامل

Sound Effects Taxonomy Management in Production Environments

Categories or classification schemes offer ways of navigating and higher control over the search and retrieval of audio content. The MPEG-7 standard provides description mechanisms and ontology management tools for multimedia documents. We have implemented a classification scheme for sound effects management inspired on the MPEG-7 standard on top of an existing lexical network, WordNet. WordNet...

متن کامل

Performance of MPEG-7 spectral basis representations for retrieval of home video abstract

In this paper, we present a classification and retrieval technique targeted for retrieval of home video abstract using dimension-reduced, decorrelated spectral features of audio content. The feature extraction based on MPEG-7 descriptors consists of three main stages: Normalized Audio Spectrum Envelope (NASE), basis decomposition algorithm and basis projection, obtained by multiplying the NASE ...

متن کامل

Parameter-Based Categorization for Musical Instrument Retrieval

In the continuing goal of codifying the classification of musical sounds and extracting rules for data mining, we present the following methodology of categorization, based on numerical parameters. The motivation for this paper is based upon the fallibility of Hornbostel and Sachs generic classification scheme, used in Music Information Retrieval for instruments. In eliminating the redundancy a...

متن کامل

Defect Image Classification and Retrieval with MPEG-7 Descriptors

In this paper the visual content descriptors defined by the MPEG-7 standard are applied to defect image classification and retrieval. A pre-classified defect image database is used in evaluation. The experiments are done with a KNN classifier and with a PicSOM content-based image retrieval system. Results indicate that the MPEG-7 features work with a high level of success, especially the Color ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003